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have been amended. No new matter has been added. Upon entry of the present amendment, claims 
27-29, 35-41 and 43-5 1 will remain pending. Because the amendments to the claims remove issues 
for appeal (i.e. , indefiniteness and anticipation rejections), Applicants respectfully request that they 
be entered into the record. See, M.P.E.P. § 714.12. 

As a preliminary matter, Applicants acknowledge receipt of the "Attachment for PTO-948" 
outlining changes for prosecution of applications containing drawings. Formal drawings have been 
filed on date even herewith under separate cover to the Draftsperson. Original Figures 6-8, 1 1, 12, 
16,17, 19, 2 1-24, 26, 27, 29-65 and 67-97, which are not essential subject matter nor are required to 
comply with any portion of 35 U.S.C. § 112, have been deleted from the present application. The 
specification has been amended throughout* take into consideration the deleted figures. In addition, 
because no amino acid or nucleotide sequences are present in the specification or drawings, as 
currently amended, there is no longer a requirement for a Sequence Listing. 

Claims 35 and 43 have been amended to recite "RNA of a selected organism" rather than 
"target nucleic acid." The amendment is made to harmonize the language of the claims. No new 
matter is added and no change in scope is intended. 

Claims 35 and 5 1 have been amended to proviso out the 3 1 untranslated region of the histone 
mRNA. As amended, the "molecular interaction site" of the claims does not include the 3' 
untranslated region of the histone mRNA. Support for the amendment is found, for instance, in 
Example 3 uc pages 37-38 of the specification. No new matter is added. 

I. The Claims Are Clear And Definite 

Claims 35-41 and 43-50 stand rejected under 35 U.S.C. § 112, second paragraph, as 
allegedly being indefinite for failing to particularly point out and distinctly claim the subject matter 
which Applicants regard as their invention. The Office Action asserts that the phrase "said target 
nucleic acid" can still be found in claims 35, 43 and 5 1 . Applicants intended to amend each of claims 
35-41 and 43-50 to recite "RNA of a selected organism" in place of "target nucleic acid" in 
Applicants' Amendment and Request for Reconsideration, dated July 24, 2000. However, as pointed 
out by the Office Action, Applicants inadvertently neglected to amend all of the occurrences of the 
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phrase. Claims 35, 43 and 51 have been amended according., In view of «he amendment to the 



withdrawn. 



II. The Claimed Invention Is Novel and Not Obvious 
A The Williams Reference 

Cairns 35-4. and 43-51 stand rcjected under 35 U.S.C. § 102(b) as alleged,, bemg 
anticipated by Williams - a,., M*. ««. « » 4660-4666 (heretnafter the "Williams 

of the histone mRNA constitutes a molecular interaction site. Applicants have amended clatms 
an<i 5! to recite tha, the oligonucleottde does no, comprise the V untranslated region of the h.s.one 
mRNA. The Wil.iams reference fails to teach every feature recited in claims 3*4 an - 
amended. A^nily.AppHe^-^^^W^^^"^ 11 ^^ 

withdrawn. 

B. The Garcia Reference 

Claims 27-29 and 35 stand rejected under 35 U.S.C. § 102(b) as allegedly being anticpated 
bv or in the alternative, under 35 U.S.C. § 103(a) as allegedly being obvious over Garca e, a,., J. 

feature recited in the claims. f 

The Garcia reference fails to teach or suggest at least two aspects of apphcants darned 
invention. Firs, the Garcia reference fails to teach or suggest an oligonucleotide, let alone an 
oUgo „uc,eotide,ha,c„m P nsesamo.ecu,ar interaction site, as recited in independent clatms 27 and 
35 Rather, the Garcia reference reports that the ribosome binding domain of E. coU translation 
ini aaUon factor fF3 intetacts with 1«S RNA. However, the 16S RNA molecule that is relied on for 
theinstantrejection is not an oligonucleoUde, as called for by the claims. Applicants' clatms clearly 
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distinguish namrally-occurring nucleic acid molecules, such as prok*yo,ic RNA. from 
oligonucleotides. Indeed, one of ordinary skill in the art would understand .ha. the oligonucleottdes 
of.heinvenUonarepreparedbase.onsuchRNAmolecules.NosucholigonucleoUdeisprepared.n, 

ortaughtby.theGarciareference and, thus, the reference does no. anticipate claimed invenuon. 

The Garcia reference further fails to teach or suggest that the molecular interact™ sue of 
the 16S RNA is present in "a. leas, one additional prokaryotic RNA," as recited in the claims. The 

the 16S RNA molecule, and not with any other RNA. 

Finally with respec. to the al.erna.ive § 103(a) rejection, the Office Action fails to make out 
a prima facie showing of obviousness. In establishing .prima facie case of obviousness under 35 
USC § 103 it is incumbent upon the Examiner to provide a reason why one of ordinarysk.il ,n the 
^would have been led to modifyaprior art referenceorto combine reference teachings .oamve a. 

,„e claimed invention. E, parte Clapp, Til U.S.P.Q. 972 (Bd. Pat. A PP . In,. 1985). To this end, .he 
requisite mo.iva.ion mus. s.em from some .eaching, sugges.ion or inference in .he pnor art as a 
whole or from .he knowledge generally available to one of ordinary skill in .he art and no. from 
appellants' disclosure. See, for example, Uniroyal ,nc. , RuSUn-WUey Corp.. 5 U.S.P.Q.2d MM 
(FedCir ,988);and ap a r r e N e «,25U.S.P.Q.2d.8.7,.819(Bd.Pa..App.In,.1992).Here,,he 

Office Action fails .o provide a reason as .o why one of ordinal skil. in .heart wouldhave been led 
,„ modify .he ribosoma, subuni, of .he Garcia reference .0 arrive a. the oligonucleotide of clatmed 
invention Nor does the Office Action point <o any reference .ha. discloses .he purported molecular 
i„,erac.ion site of .he ribosomal subuni. is found in an additional prokaryotic RNA. Absen, such 
.eachings .he rejection of the claims as allegedly being obvious mus. be withdrawn. 

Applicants respectfully reques. that the rejection under 35 U.S.C. § 102(b) or, alternatively, 

35 U.S.C. § 103(a) be withdrawn. 
C. The Gutell Reference 

Claims 35-41 and 43-50 stand rejected under 35 U.S.C. § 102(b) as allegedly being 
anticipatedbyor.in the alternative, under 35 U.S.C. § 103(a) as allegedly.being obvious over Gutell 

35 



PATENT 

DOCKET NO.: IBIS-0012 

suggest every feature recited in the claims. 

rRNAs for E. coU, yeast, ana C. The Office Action improper,, asserts a scient.fic 

llusion that is neither taught nor proposed in the Gutei, reference. The Office Action assets a. 

r „m . different organisms and . . . compared their suture second structure. The Gu* 
„ clearly sta.es the ooieCves of the su.ec, paper a. page 305, H g , co umn u^er 
O^^sentourmo^ 

bouom of right column: "mi.ustrated in this article are three divergent 16S and 16WA. rRNAs 
t eachesawayfrom.heconc,usionsasser«edin,heOfficeAcUonat m efoo,no,e,oF,gure2w h c«,n,t 

" a, struct were manipuiated "manua.lv," through the use of an "interacttve se,u nee 

Tnl o study the potentia, aiignment o, the suojec, regions of rRNA across species, hut rat^ 
I ed to pLn, "their" structure for the - » - • 

ritLeiementofciaim^.inpanicuiarridentif.ngat.eastones^uencere.onwh.ch.s 

conserved among said plurality of nucleic acids and said RNA of a se.ec.eO orgamsm. 

fc addiL, as record in the 0ff.ee Action, the Ou.eU reference does no. ,den„f y a 
molecular interaction site or determine the secondary stnacture of a conserved region w.thm d» 
^ The Office Action mis^nly assets, however, .ha. these features a. simply .nhere t 
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offer evidence that .he inherent characteristics would necessarily (i.e.. aiways) be present in an 

J address .he lac* of evidence o, record, and, according,, AppHcants request .ha. .he reject™ 

based on the Gutell reference be withdrawn. 

Further the ribosomal RNA of the Often reference does not anticipate the ohgonudeoude 

canedforinApphcan.s.ciain*.^ 

rnoiecuiarin.erac.ionsiteonUtetSSstructureisfoundinoneadditionaiprocaryoncRNA.Tha.e 
RNA disciosed in the Gutel, reference can be found in "differen, organisms," as poimed out ,n 
Office Action, is of no consequence, and fai.s «o briag .he .eachings of the GuteU reference w,.h,n 
the scope of Applicants' claims. 

Finally for .he same reasons se, forth above with respect to .he Garca reference, .he Office 
Ac ,ion fails to ma k e ou. a case of obviousness. According, y Applicant, respect^ 

reqU est that the rejection under 35 U.S.C. , 102(b) o, alternatively, 35 U.S.C. , 103(a) be 
withdrawn. 
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HI. Conclusion JV -~.fi™. 

In v i ewof«hefo re gomg.AppHca n ts reS pectf U Uysubm i t.hac«heclaim S a rel nco„d 1 .,onfor 

aUowance. An eariy notice of .he same is earnest,, soUcited. The Examiner is invited to contact 
Appiicants' undersign* representative at (2.5) 564-8906 if there are any questions regard* 
App.icants' ciaimed invention. Attached hereto is a mar^p version of the changes made to , 
Ration and Cairns by the current amendment. The attached page is captioned ■•V er3M! h 
markings to si""" ^hanpes made." 



Date: December 13, 2001 

WOODCOCK WASHBURN LLP 
One Liberty Place - 46th Floor 
Philadelphia, PA 19103 
Telephone: (215) 568-3100 
Facsimile: (215) 568-3439 



Respectfully submitted, 




Paul K. Legaard 
Registration No. 38,534 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE 

In the Applica<'° n: 

Pages 1-96 containing the Sequence Listing have been deleted. 



In the Specification: 

Paragraph beginning at page 5, line 21 of the specification has been deleted. 

Paragraph beginning a, page 5, line 23 of the speeification has been deleted. 

Paragraph beginning a, page 5, line 25 of the specification has been deleted. 

Paragraph beginning at page 5, line 28 of the specification has been amended as follows: 
Figure 6 [9] shows an exemplary descriptor. 

Paragraph beginning at page 5, line 29 of the specification has been amended as follows: 
Figure 7 [10] shows a set of e-value scores for ferritin. 

Paragraph beginning at page 6, line 1 of the specification has been deleted. 

Paragraph beginning at page 6, line 3 of the specification has been deleted. 

Paragraph beginning at page 6, line 5 of the specification has been amended as follows: 
Figure 8 [13] shows a representative lookup table used in Q-compare r- 
CompareOverWins. 



or 



Paragraph beginning at page 6, line 7 of the specification has been amended as follows: 
Figure 9 [14] shows a representative block diagram of a program called RevComp. 
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Paragraph beginning at page 6, line 8 of the specification has been amended as follows: 
Figure 10 [15] shows a representative flow chart showing preferred steps of a preferred 
database search strategy for ortholog finding. 

Paragraph beginning at page 6, line 10 of the specification has been deleted. 

Paragraph beginning at page 6, line 12 of the specification has been deleted. 

Paragraph beginning at page 6, line 14 of the specification has been amended as follows: 
Figure ii [18] shows a representative flow scheme showing preferred steps for a 
preferred SEALS strategy. 

Paragraph beginning at page 6, line 16 of the specification has been deleted. 

Paragraph beginning at page 6, line 18 of the specification has been amended as follows: 
Figure 12 [20] represents a genetic map showing a conserved iron response element ,n the 
5'UTR of mouse and human ferritin. 

Paragraph beginning at page 6, line 20 of the specification has been deleted. 
Paragraph beginning at page 6, line 22 of the specification has been deleted. 
Paragraph beginning at page 6, line 24 of the specification has been deleted. 

Paragraph beginning at page 6, line 25 of the specification has been deleted. 

Paragraph beginning at page 6, line 26 of the specification has been amended as follows: 
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Figure 13 [25] shows representative flow scheme showing preferred steps for a preferred 

Structure Predictor strategy. 

Paragraph beginning at page 6, line 28 of the specification has been deleted. 
Paragraph beginning at page 6, line 30 of the specification has been deleted. 

Paragraph beginning at page 7, line 1 of the specification has been amended as follows: 
Figure 14 [28] shows a representative structure drawing of ferritin 5 UTR. 

Paragraph beginning a, page 7, line 2 of the specification has been deleted. 

Paragraph beginning a. page 7, line 3 of the specification has been deleted. 

Paragraph beginning a, page 7. line 5 of the specification has been deleted. 

Paragraph beginning at page 7, line 6 of the specification has been deleted. 

Paragraph beginning at page 7, line 7 of the specification has been deleted. 

Paragraph beginning a. page 7, line 9 of the specification has been deleted. 

Paragraph beginning at page 7, line 11 of the specification has been deleted. 

Paragraph beginning a. page 7. line 12 of the specification has been deleted. 

Paragraph beginning a. page 7, line 14 of the specification has been deleted. 
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Paragraph beginning at page 7, line 15 of the specification has been deleted. 

Paragraph beginning at page 7, line 16 of the specification has been deleted. 
Paragraph beginning at page 7, line 18 of the specification has been deleted. 
Paragraph beginning at page 7, line 20 of the specification has been deleted. 
Paragraph beginning at page 7, line 22 of the specification has been deleted. 
Paragraph beginning at page 7, line 23 of the specification has been deleted. 

Paragraph beginning at page 7, line 25 of the specification has been deleted. 

Paragraph beginning at page 7, line 27 of the specification has been deleted. 

Paragraph beginning at page 7, line 28 of the specification has been deleted. 

Paragraph beginning at page 7, line 30 of the specification has been deleted. 

Page 8 has been deleted. 

Paragraph beginning at page 9, line 1 of the specification has been deleted. 
Paragraph beginning at page 9, line 3 of the specification has been deleted. 
Paragraph beginning at page 9, line 5 of the specification has been deleted. 
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Paragraph beginning at page 9, line 7 of the specification has been amended as follows: 
Figure 15 [66] shows a representative mass-spec structure probe analysis of region 1 of 
ornithine decarboxylase 3' UTR. 

Paragraph beginning at page 9, line 9 of the specification has been deleted. 
Paragraph beginning at page 9, line 1 1 of the specification has been deleted. 
Paragraph beginning at page 9, line 13 of the specification has been deleted. 
Paragraph beginning at page 9, line 15 of the specification has been deleted. 
Paragraph beginning at page 9, line 17 of the specification has been deleted. 
Paragraph beginning at page 9, line 19 of the specification has been deleted. 

• t 

Paragraph beginning at page 9, line 21 of the specification has been deleted. 
Paragraph beginning at page 9, line 23 of the specification has been deleted. 
Paragraph beginning at page 9, line 25 of the specification has been deleted. 
Paragraph beginning at page 9, line 27 of the specification has been deleted. 
Paragraph beginning at page 9, line 29 of the specification has been deleted. 
Paragraph beginning at page 9, line 30 of the specification has been deleted. 
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Paragraph beginning at page 10, line 1 of the specification has been deleted. 

Paragraph beginning at page 10, line 3 of the specification has been deleted. 
Paragraph beginning at page 10, line 5 of the specification has been deleted. 
Paragraph beginning at page 10, line 6 of the specification has been deleted. 
Paragraph beginning at page 10, line 8 of the specification has been deleted. 
Paragraph beginning at page 10, line 10 of the specification has been deleted. 
Paragraph beginning at page 10, line 1 1 of the specification has been deleted. 
Paragraph beginning at page 10, line 12 of the specification has been deleted. 
Paragraph beginning at page 10, line 14 of the specification has been deleted. 
Paragraph beginning at page 10, line 16 of the specification has been deleted. 
Paragraph beginning at page 10, line 17 of the specification has been deleted. 
Paragraph beginning at page 10, line 18 of the specification has been deleted. 
Paragraph beginning at page 10, line 19 of the specification has been deleted. 
Paragraph beginning at page 10, line 20 of the specification has been deleted. 
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Paragraph beginning at page 10, line 21 of the specification has been deleted. 

Paragraph beginning at page 10, line 22 of the specification has been deleted. 

Paragraph beginning a. page 10, line 23 of the specification has been deleted. 

Paragraph beginning a. page 10, line 24 of the specification has been deleted. 

Paragraph beginning at page 10, line 25 of the specification has been deleted. 

Paragraph beginning at page 16, line 1 1 of the specification has been amended as follows: 
Additional nucleic acid targets may be determined independently or can be selected from 
puoticly available prokaryotic and eukaryotic genetic databases known to those skilled in the art. 

which is a database of genetic mutations associated with disease, was developed, ,n par,, d. 
National Center for Biotechnology Information (NCBI). BMM««» 

^^77 0 r example, httpJ/w^b^gov/OminV., CGAP. wh.ch ,s an 
htt ,„cb,n,m.nih,ov OT 

or partial nucleotide sequences. In addition, nuc.eic acid targets can also be se,ected *» pnv* 
can be determined especially for use in connection with the present invention. 

Paragraph beginning at page !6, line 25 of the specification has been amended as follows: 
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Ate . nucleic acid targe, is select or provided. the nucleotide sequence of the nucleic 
acid targe, is determined and then compared to the nucleotide sequences of a plurality of nuc,,c 
acids from different — ic specie, In one embedment of the invention, the nucleotide 
sequence of the nucleic acid targe, is determined by scanning a, least one genetic database or ,s 

art induce, for example, the Expressed Gene Anatomy Database (EGAD) and Unigene-Homo 
Sapiensda«abase(Uni g cne),GenBa„Mnd,he,*e.EGADcontainsano„-redundantse«ofhuman 

.ranscrip. (HT) sequences [and can be accessed through the Internet a, for example 
hB p //www,igr.org/.db/egad/egad.h,ml] intoWMM thrnnrh the In te nt at the world 
„ r„, r-r- ■T-T'^-H^d.html. Umgene is a system for au.omauca.ly 
partitioningGenBanlcsequences into a non-redundant set of gene-oriented clusters. Each Umgene 
cluster contains sequences that represent a unique gene, as well as related information such as the 
tissue types in which the gene has been expressed and map location. 

Paragraph beginning a, page 17. line 10 of the specification has been amended as follows: 
In addition, Unigene contains hundreds of thousands of novel expressed sequence tag (EST) 
sequences. [Unigene can be accessed through the Interne, at, for. example, 
n „ p ,/www.ncbi.nlm.nih.gov/UniGcne/.] Uju^eisjuHh W "W "» 

,„ fnr -- r .-^^^n, h ^,niGene/. These databases can be use ,n 

connection with searching programs such as, for examp!e. Entrez, which is known and available to 
.nose skilled in the art, and the like. [Entrez can be accessed through the Interne, a, for example 

hllp .//www.ncbi.nlm.nih.gov/En.rez/.]r ■ r nh|iHv.v 1 ilahHhmu f h.heIn.emexaUtewprld 

M^jnr--r— entrez- Preferably, the most complete nucle,c ac,d 

sequencerepresemationavailable from various daiabases is used. The GenBank da.abase, wh,ch ,s 
known and available to those skilled in the art, can also be used to obtain .he most complete 
nucleotide sequence. GenBank is the NM genetic sequence database and is an annotated collection 
of all publicly available DNA sequences. GenBank is described in, for example, Nuc. Acds Res., 
1998. 26, 1-7, which is incorporated herein by reference in its entirety, and can be accessed by .hose 
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sWnedinmeanlthroughmelnten^ 

tatau-a ton-'v ' ' w " rld 

m| t|| „„, r .. r .K^K a nW,n ri ex.h 1 ml. AUematively.partalnuclecidesequencesof nudcc 
acid targets can be used when a complete nucleotide sequence is not available. 

Paragraph beginning at page 17, line 25 of the specification has been amended as follow: 
,„ another embodiment of the present invention, the nucleotide sequence of the nucle,c 
acid targe, is determined by assembling a plurality of overlapping expressed sequence tags 
(ESTs) The EST database (dbEST), which is known and available to those skilled ,n the art, 
comprises approximately one mUlion different human mRNA sequences comprising from about 
500 to 1000 nucleotides, and various number, of ESTs from a number of different orgamsms. 
[dbEST can be accessed through the Internet at, for example, http://www.ncbi.nlm.mh.gov/ 
dbEST/index.h,ml.] d bFST is publicly available .hw* If Internet „ ,he world wi dewebaj, 
^^^: r , rr ,K r ^T/,ndex.h,ml. These sequences are derived from a clonmg 
strategy that uses cDNA expression clones for genome sequencing. ESTs have applications ,n 
me discovery of new genes, mapping of genomes, and identification of coding regions » 
genomic sequences. Another important feature of EST sequence information that is becommg 
rapidly available is tissue-specific gene expression data. This can be extremely useful » 
targeting selective gene(s) for therapeutic intervention. Since EST sequences are relatively short, 
,hey must be assembled in order to provide a complete sequence. Because every avai.able Cone 
is sequenced, it results in a number of overlapping regions being reported in the database. 

Paragraph beginning a, page 18, line 9 of the specification has been amended as follows: 
AssemblyofoverlappingESTsextendedalongboththeS'andS'directionsresuteinafull- 

leng ,h -virtual transcript." The resultant virtual transcript may reptesen, an already charactered 
„„c,eic acid or may be a novel nucleic acid with no known bio^ca, function. The Institute for 
GenomicResearch^GWHumanGenomelndexCHGDdatabase, which is known and avatlableo 
.hose skffled in the art, contains a .is, of human transcripts. [TIGR car, be accessed through the 
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T .HatfLampJe^^ 
h ln,b>er is a ,00, for assembling large - of overling seouence data s U ch as 

fst Blast [PHRAP can be accessed through the Internet at, 
^^^^^7 , u , the pitted steps set for* » Ftgure 2. 

q ll s^ — - be performed — , »r bv usinS ^Uv^ 
compu ter plgrams .own .o those skilled in the a, Preferablv, - - = — 
, -, hms which are available and known to those skilled ,n the art, and the hke can be use 

:r ::i — , - - . — -» - -rrr: 

„ t h ,es TBlast can be accessed through the Internet at, for example, 
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database GCOP.c lt agev.9.0 i sacommercia 1 lyav^ab.e software package that contains over! 00 
interrelated software programs mat enables a„a,ys,s of sequences by editing, mapping, comparmg 
and aligning mem. Other programs included in the GCG Package include, for example, programs 
which faciiitate RNA secondary structure predictions, nuc.eic acid fragment assem* and 
evolutionary analysis. In addition, the most prominent genetic databases (GenBank, EMM. HR. 
and SWISS-PROT) are distributed along with the GCG Package and are fully access.ble wuh the 
aatabase searching and manipulation programs. [GCG can be accessed through the Interne, a,, for 

example, http://www.gcg.com/.] W « 

MaLSMmMmL Fetch is a tool available in GCG that can get annotated GenBank 
rccords basedon accession numbe. and is similar to En tre z. Another sequence simtlanty searched, 
be performed with GeneWor.d and GeneThesaurus from Pangea. GeneWorld 2.5 is a, automated, 
flexible, high-throughpu, application for analysis of polynucleotide and protein sequence, 
GeneWorld allows for automatic analysis and annotations of sequences. Uke GCG, GeneWorld 
incorporates several tools for homology searching, gene finding, multiple sequence alignment 
secondary structure prediction, and motif identification. GeneThesaurus l.Otm is a sequence and 
annotation data subscription service providing information from multiple sources, proving a 
relational data model for public and local data. 

Paragraph beginning at page 20, line 24 of the specification has been amended as follows: 
Another toolkit capable of doing sequence similarity searching and data manipulation » 
SEALS also from NCBI. This tool set is written in perl and C and can run on any computer 
platform that supports these languages. [It is available for download, for example, at: 
hl , p .//www.ncbi.nlm.nih.gov/Walker/SEALS/.] J . I, ^ ™> N» ^ the "-M ""* 
MttaaMriuLjfc » m n tB nchi.nlm.nih.goWW^SEALaThis toolkit provides access 
,„ Blas.2 or gapped blast. It also includes a tool called ^collector which, in conjunction w,.h a 
.ool called <ax_break, parses the output of B,as.2 and returns the identifier of the sequence most 
homologous to the query sequence for each species present. Another useful too, is fea.ur=2fasta 
which extracts sequence fragments from an input sequence based on the. annotation. An 
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exemplary use for this tool is to create sequence files containing the 5' untranslated region of a 

cDNA sequence. 

Paragraph beginning a. page 21 , line 30 of .he specification has been amended as follows: 
fc another embodiment of the invention, the sequences required are obtained by searchmg 
ortholog database, One such database is Hovergen, which is a curated database of vertebrate 
orthologs. Ortholog sets may be exported from this database and used as is, or used as seed, for 
further sequence similarity searches as described above. Further searches may be des.red, for 
example, to find invertebrate orthologs. [Hovergen can be downloaded, for example, at: 
ft p.//pbiUniv-lyonl.fr/puWhovergen/.l Hovesguuul.lirly lhB . 

J JJLLimrrr — ^^w^ A toabase of prol 7 0,,c 

ormologs COGS, is available and can be used interactively ,hro„ r h the Interne, a. the w v rl d 
ssas ^ tiU2r ^ mo ^ [on the internet, for example at: h«.p://wwwj ncbi.nlm.nih.gov/COG/. 

Paragraphbeginning a. page 23, line 22 of the specification has been amended as follows: 
Sequence homology between the window sequence of the target nucleic actd and me 
query sequence of any of the plurality of nucleic acid sequences obtained as described above, ,s 
preferably at least 60%, more preferably a. leas, 70%, more preferably a. leas. 80%, and most 
preferably a. leas. 90%. The most preferable method of choosing the threshold is to have the 
computer automatically try all thresholds from 50% to 100% and choose a threshold based a 
metric provided by the user. One such metric is to pick the threshold such .ha, exactly n has are 
returned, where n is usually set to 3. This process is repeated until every base on the query 
nucleic acid, which is a member of the plurality of nucleic acids described above, has been 
compared to every base on the master target sequence. The resulting scoring matnx can be 
plotted as a scatier plo, Based on the match densi.y a, a given location, .here may be no dots, 
isohted dots, or a se. of dots so close ,oge.her that they appear as a line. The presence of Imes, 
however small, indicates primary sequence homology. [A representative scatter plot 
interspecies sequence comparison is depicted in Figure 6.) Sequence conservation within nuclcc 
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acid molecules, pa«icu,ar>y the UTRs of RNA, in divergent species is likely to be an indicator of 
conserve regulatory elements .ha, are also likely to have a secondary structure. Tne resuUs of 
the imerspecies sequence comparison can he analyzed using MS Excei and visual basic tools ,n 
an entirely automated manner as known to those skilled in the art. 

Paragraph beginning at page 23, line 22 of the specification has been amended as follows: 
In another embodiment of the invention, secondary structure analysis is performed by self 
complementarity comparison. Preferably, self complementarity comparison is performed »s,ng 
Compare, described above. More preferably, Compare can be modified to expand the pamng 
matrix to account for G-U or U-0 basepairs in addition to the conventional Watson-Cnck G- 
C/C G or A-U/U-A pairs. Such a modified Compare program (modified Compare) begms by 
predicting all possible base-pairings within a given sequence. As described above, a 
conserved region, preferably a UTR, is identified based on primary sequence companson of a 
series of orthologs. In modified Compare, each of these sequences is compared to >.s own 
reverse complement. [Figure 7 depicts an exemplary self complementarity analysis.] Allowable 
base-pairings include Wafson-Crick A-U, G-C pairing and non-canonical G-U pamng. An 
overlay of such self complementarity plots of al, available orthologs, and selection for the most 
repetitive pattern in each, results in a minimal number of possible folded configurations. (Rgure 
8 shows an exemplary overlay.] These overlays can then used in conjunction with addmona, 
constraints, including those imposed by energy considerations described above, to deduce the 
most likely secondary structure. 

Paragraph beginning at page 24, line 17 of the specification has been amended as follows: 
In one embodiment of the invention, secondary structure analysis is performed by ahgnmen. 
and covariance analysis. Numerous protocols for alignment and covariance analysis are known to 
mose skilled in the art. Preferably, alignment is performed by C,usta.W, which is avatla e an 
known to those skilled in the art. ClustalW is a tool for multiple sequence alignment that, ahhougl, 
no, a par. of GCG, can be added as an extension of the existing GCG too, se, and used wi,h local 
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sequences. IClustalW can be accessed <hrou E h «he Interne, at, for example, 
httpV/dot.imgen.b.m.unc.edu^SSl/mul.i-angn/Options/ctetalw.h.ml.] Hns. alW is . publ.cW 

a ^ !!s _i! MS b *e * r 21 T^f^" 

,, |1 , irr -, Hn . QM1 , r .. | .^ r .^^,.l„.. al w.h, ml . ClustalW is also descnbe4 m 

Thompson,«a ; .,N U , A c iifa to.,1994,22,4673-4680,*chi si ncorpora tt dhereinbyrefe re nce,n 

i,sen,ire.y These processes can be : scripted to automatically use conserved UTR regions identified* 
earlier steps. Seqed, a UNIX command line interface availableand known to those skilledmrne art, 
allows extraction of selected local regions from a larger sequence. Multiple sequences from many 
different species can be clustered and aligned for further analysis. 

Paragraph beginning a. page 25, line 1 1 of the specification has been amended as follows, 
Covariation is a process of using phylogenetic analysis of primary sequence information for 
c onsensussecondarys«.c.ure prediction. Covariation is described in me following referees, each 
of which isincorpora.edhereinbyreferenceinmeiren,ire,y : Gu,=,l,e,. ( ., "Comparative Sequence 
AnalysisOfExperimentsPerformedDuringEvo.u.ion-InRibosomalRNAGroupIIn^s.Green, 

Ed Austin:Landes, 1996';'Gautheret, et at, Nuc. Acids Res., \997, 25, 1 559-1564; Gautheret, et °^" 
RNA 1995 1, S07-8U; Lodmell, e, a,., Proc. M Acad. Sci. USA, 1995. 92, 10555-10559; 
Gautheret, * ai. J. Mol Biol, 1995, 248, 27-43; Guiell, Nuc. Acids Res., 1994, 22, 3502-35H; 
'Gutell Nuc. Acids Res., 1993, 21, 3055-3074; Gutell, Nuc. Acids Res., 1993, 21, 3051-3054; Woese, 
Proc Natl Acad. Sci. USA,m9,86,in^n2-,md^ X ,e,aL,Nuc.AcidsRes.,i9m,8,2215- 
2293 Preferably.covariance software is used for covariance analysis. Preferably, Covariation, a set 
of programs for the comparative analysis of RNA structure from sequence alignments. ,s used. 
Covanation uses phylogenetic analysis of primary sequence information for consensus secondary 
structure prediction. [Covariation can be obtained through the Interne, at, for example, 
h, tp 7/www.mbio.ncsu.edu/RNaseP^ 

te^i^lhe, intact « Hr world Mr * 

lMm | ||Tr -^. f ^ r Wn m , ra ms.html. A complete description of a vers.on of the 

program has been published (Brown, J. W. 1991 Phylogenetic analysis of RNA stiuCure on .he 
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types of covariation analysis torn RNA sequence alignments, including standard c—n 
analysis, the identification of compensatory base-changes, and mutual information ana,ys,s. The 

program; it does not requi K HyperCard (although a much smailer 'stack' version is included). Th,s 
program will run in any Macintosh environment running MacOS v7.i or higher. Faster processor 
machines(68040or PowerPC) is suggested for mutual information analysis or the analysis oflarge 

sequence alignments. 

Paragraph beginning at page 26, line 5 of the specification has been amended as follows: 
In another embodiment of the invention, secondary structure analysis is performed by 
secondary structure prediction. There are a number of algorithms that predict RNA secondary 
structures based on thermodynamic parameters and energy calculations. Preferably, secondary 
srructure prediction is performed using either M-fo,d or RNA Structure 2.52. [M-fold can be 
accessed^ughuteMemetat.^^^ 

J— f ^orcanbedow„loadedfor,ocaUseonU>r K platforms.M-fp.disa.soava 1 lab.e 
as a part of GCG package. RNA Structure 2.52 is a windows adaptation of the M-fold algorithm and 
[c an be accessed through thelnteme, at, forexample, h „p://128.15U76.70/RNAs tt uc.ure.h m ,] !S 
^HM,, ,| tr 176 7<VPNM<n.cture.h,m l. 

Paragraph beginning at page 26, line 29 of the speoif.ca.ion has been amended as follows: 
,„ another preferred embod.men« of the invention, the output of AlignHits is read by a 
program called RevComp. A block diagram of this program is shown in Figure 9 [14]. Th,s 
program could be .produced by one skMed in the art. A preferred purpose of this program ,s to 
use base pairing rules and ortholog evolution to predict RNA secondary structure. RNA 
secondary structures arc composed of single stranded regions and base paired regions, cailed 
stems Since structure conserved by evolution is searched, the most probable stem for a g.ven 
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alignment of ortholog sequences is the one which could be formed by the most sequences. 
Possible stem formation or base pairing rules is determined by, for example, analyzing base 
pairing statistics of stems which have been determined by other techniques such as NMR. The 
output of RevComp is a sorted list of possible structures, ranked by the percentage of ortholog set 
member sequences which could form this structure. Because this approach uses a percentage 
threshold approach, it is insensitive to noise sequences. Noise sequences are those that either not 
true orthologs, or sequences that made it into the output of AlignHits due to high sequence 
homology even though they do not represent an example of the structure which is searched. A 
very similar algorithm is implemented using Visual basic for Applications (VBA) and Microsoft 
Excel to be run on PCs, to generate the reverse complement matrix view for the given set of 
sequences. 

Paragraph beginning at page 28, line 13 of the specification has been amended as follows: 
After the hypothetical structure motifs-are determined from the secondary structure 
analysis described above, a family of structure descriptor elements is constructed. Preferably, the 
structural motifs described above are converted into a family of descriptor elements. An 
exemplary descriptor element is shown in Figure 6 [9]. One skilled in the art is familiar with 
construction of descriptors. Structure descriptors are described in, for example, Laferriere, et al., 
Comput. Appl. Biosci., 1994, 10, 211-212, incorporated herein by reference in its entirety. A 
different structure descriptor element is constructed for each of the structural motifs identified 
from the secondary structure analysis. Briefly, the secondary structure is converted to a generic 
text string, such as shown in Figure 6 [9]. For novel motifs, further biochemical analysis such as 
chemical mapping or mutagenesis may be needed to confirm structure predictions. Descriptor 
elements may be defined to have various stringency. 

Paragraph beginning at page 28, line 24 of the specification has been amended as follows: 
For example, referring to Figure 6 [9], the region termed HI, which comprises the first 
region of the stem, can be described as NNN:NNN, which contemplates any complementary base 
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pairingincludngG-C.C^.A-U.andU-A. The HI region may also be designated so as to 
include only C-G or A-U. etc.. base pairing. In add.tion, the descriptor elements can be defined 
,„ allow for a wobble. Thus, descriptor elements can be defined to have any level of stringency 
desired by the user. Applicants" invention, thus, is also directed to a database compnsmg 
different descriptor elements. 

Paragraph beginning at page 29, line 10 of the speaficauon has been amended as follows: 
In one embodiment of the invention, nucleic acids having secondary structure wtach 
correspond to the structure descriptor elements are identified by searching at leas, one database. 
Any genetic database can be searched. Preferably, the database is a UTR database, which „ a 
compilation of the untranslated regions in messenger RNAs. A TJTR database [is accesstble 
through the Internet at, for example, ft p://area.ba.cnr. W puWembn=^da.abase/utr/l i^Msk 

dajabaseAur/. Preferably the database is searched using a computer program, such as, for 
example Rnamot, a UNK-based motif searching tool available from Daniel Gautheret. Each 
■new" sequence that has ffie same motif is then queried against public domain databases to 
identify addidona. sequences. Results are analyzed for recurrence of pattern in UTRs of these 
additional oriho.og sequences, as described below, and a database of RNA secondary structures 
is built One skilled in the art islfamiliar with Rnamot. Briefly, Rnamot takes a descriptor string, 
such as the one shown in Figure fi 19], and searches any Pasta forma, database for posstble 
matches Descriptors can be very specific, to match exact nucleotide^), or can have bu,lt-,n 
degeneracy. Lengths of the stem and loop can also be specified. Single stranded loop regtons 
can have a variable length. G-U pairings are allowed and can be specified as a wobble parameter. 
Allowable mismatches can also be included in the descriptor definition. Functional sigmficance 
is assigned to the motifs if their biological role ,s known based on previous analysts. Known 
regulatory regions such as Iron Response Element have been found using this techmque (see, 
Example 1 below). In embodiments of the invention in which a database containing prokaryouc 
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modular interaction site, is compiled, it is preferable to refrain from searching human 
sequences or, alternatively, discarding human sequences when found. 

Paragraph beginning a, page 32, line 26 of the specification has been amended as follows: 
An early step in the process is to use the master sequence (nucleotide or protem) to find 
and rank related sequences in the database (orthologs and paralogs). Sequence similanty search 
algorithms are used for this purpose. All sequence similarity algonthms calculate a quanutanve 
measure of similarity for each result compared with the master sequence. An example of a 
quantitative result is an E-value obtained from the Blast algorithm. The E-values for a blast 
search of the non-redundant GenBank database using ferritin mRNA as the query sequence 
illustrates the use of quantitative analysis of sequence similarity searches. The E-value ,s the 
probability that a match between a query sequence and a database sequence ccurs due to random 
chance Therefore, the lower an E-value the more likely that two sequences are truly .elated. A 
plot of the lowest E-value scores for ferritin is shown in Hgure 1 [10]. Sequences that meet the 
cutoff criteria are selected for more detailed comparisons according to a set of rules desenbed 
below Since an object of the sequence similarity search to find distantly related orthologs 
and paralogs, it is preferabie that the cutoff criteria no. be tec stringent, or the target of the search 
may be excluded. 

Paragraph beginning at page 33, line 23 of the specification has been amended as follows: 
When the human mRNA and mouse mRNA sequences for ferritin, which each contam an 
IRE in the S'-UTR, are analyzed in this manner, a plot showing the regions of sequence sirmlanty 
is produced!, as shown in Figure 19]. Pairwise analysis of the human and mouse ferritin mRNA 
sequences illustrate several important aspects of mis type of analysis. Regions of each mRNA 
mat encode the amino acid sequence have the highest degree of similarity, while the untranslated 
regions are less similar. [In Figure 19, the location of the IRE is indicated.) In bom (he human 
and mouse ferritin mRNAs the IREs are located in the extreme 5'end of each mRNA. Tins 
demons.ra.es.an important point - the sequence conservation in the region of me IRE structure 
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does not stand out against the background of sequence similarity between the human and mouse 
ferritin sequences. In contrast, in the comparison of human and trout [(Figure 1 1)] or human and 
chicken [(Figure 12)] ferritin mRNAs, the IREs can be immediately identified. This is because 
the sequence of the UTRs between human and trout or human and chicken are separated by 
greater evolutionarily distance than human and mouse, which is logical in view of the 
evolutionary distance that separates humans from birds and fish compared with other mammals. 
Comparing the human sequence to that of birds and fish is informative because the natural drift 
due to evolution has allowed many sequence changes in the UTRs. However, the IRE sequences 
are more constrained because they form an important structure. Thus, they stand out better and 
can be more readily identified. 

Paragraph beginning at page 34, line 28 of the specification has been amended as follows: 
The software used in the present invention makes the decision whether or not to compare 
sequences pairwise using a lookup table based upon the evolutionary distances between species. 
An example of a small lookup table using the examples described above is shown in Figure 8 
[13]. The lookup table iri'the present invention includes all species that have sequences 
deposited in GenBank. Q-Compare in conjunction with CompareOverWins decides which 
sequences to compare pairwise. 

Paragraph beginning at page 35, line 4 of the specification has been amended as follows: 
Sets of sequences that show evidence of conservation in orthologs and paralogs or other 
related genes are analyzed for the ability to form internal structure. This is accomplished by 
analyzing each sequence in a matrix where the seqeunce is plotted 5' to 3' on the X axis and its 
reverse complement is plotted 5' to 3' on the Y axis, such as in, for example, self-complementary 
analysis. Matches that correspond to potential intramolecular base pairs are scored according to a 
table of values. When the human ferritin IRE sequence is analyzed in this fashion, the diagonals 
indicate potential self- complementary regions. Each of the 13 IRE sequences described in this 
example were analyzed in the same fashion. While each of the sequences can form a variety of 
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different structures, the structure most likely to occur is one common to all the sequences. By 
superimposing the plots of all 13 individual sequences [(see, Figure 8)], the potential structure 
common to all the sequences is deduced. 

Paragraph beginning at page 36, line 5 of the specification has been amended as follows: 
[The use of] Hovergen was used to identify related sequences wtthe species and order 
levels [is shown in Figure 16 (tree classification at the species level) and Figure 17 (classification 
at the order level)]. Sequences corresponding to each of these orthologs was saved in GenBank 
format and grouped together in a single data file. Untranslated regions in both the 5' and 3' 
flanks of the coding region was extracted using SEAU5 and COWX, as shown in Figure 11 [18]. 

Paragraph beginning at page 36, line 1 1 of the specification has been amended as follows: 
The IRE sequences are more constrained because they form an important structure.' Thus, 
they stand out better and can be more readily identified even in closely related sequences. 
However, for this to work for any gene, the compare algorithm has been rewritten (see, Figures 
5A-C). This new tool, Co'mpareOverWins, allows a dynamic selection of both the range of 
window sizes, as well the hit threshold. This algorithm needs as its input parsed and separated 5' 
and 3' UTR sequences. We use tools available within the Seals genome analysis package 
described earlier to achieve this. Figure U [18] describes the steps involved. 

Paragraph beginning at page 36, line 19 of the specification has been amended as follows: 
To identify the IRE [iron responseve element] using the methods described herein, the 
compare over windows [widows] algorithm was used and the results visualized using AlignHits 
(Figure 5D for the algorithm). [Representative results are shown in Figure 23.] In addition to 
optimizing the thresholding, CompareOverWins also extracts the sequence corresponding to the 
hits. ClustalW (version 1.74) was used on the extracted sequences to create a locally gapped 
alignment [(see, Figure 24)]. A representative flow scheme for this approach is shown in Figure 
13 [25]. 
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Following ex.raction and comparison by SEALS and COWX, A.ign Hits was used <o 

define po,en.ial.y in— re^ons t(«, Fi gUr e31). One such region is shown encrded,. 

The sequent cording to ,he region o, interest was extracted from a„ spec.es for 

alignment with CLUSTAL W (1.74). Following extraction of sequence information from ^ 

hT CLUSTAL W (1.74) was used to provide multiple sequence alignment shown [<*.. F.gure 

32),' Each of the putative hi. sequences, was analyzed for the ability to form interna, structure. 

This was accomplished by analog each sequence in a matrix where the sequence was pitted 5 

,o 3-on the X axis and to complement is plotted 5' to 3' on the Y axis. Base-pairs along the 

diagonals indicate potential self-complemenury regions that can form secondary structures. 

[Figure 33 shows a representative reverse complement matrix. Figure 34 shows a] A 

representative sequence alignment in a dome forma, ^ Rowing) potential stem 

formation between the base pairs. Following conversion of the dome forma, file to a c. fle, 

RNA Structure 3.21 is used to visualize the structure {(.see, Figure 35)]. 

Paragraph beginning at page 38, line 10 of the specification has been amended as follows-. 

Vimentin is an intermediate flamen, protein whose 3U1* is highly conserved between 
spE cies Previous studies by Zehner e, a.., (Nuc. Acids Res., 1997, 25, 3362-3370) has shown 
1 a proposed a complex s,em-.oop stnrcture contained within this region may be important for 
vimentin mRNA functiorls such as mRNA location. The same region was identified usmg 
the present analysis, thus validating the present approach. In addition, based on the analyses 
described herein, a second stem-loop structure that occurs downstream of the previously . 
proposed structure that may have a role in regulating vimentin fuction as well has been .dentified 
[(see. Figure 36)]. 

Paragraph beginning a. page 38, line 18 of the specification has been amended as follows: 
A representative phylogenetic tree output for all Vimentin orthologs in Hovergen 
database w^^d [is shown in Figure 37). Each of these or.ho.ogs was saved in GenBanlc 
forma, and grouped together in a single data file. Untranslated regions in both me 5 and 3 
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flanks of the coding regions were extracted and compared using SEALS and COWX a S described 

earlier (see, Figures 11 [18] and 13 [25]). 

Paragraph beginning at page 38, line 23 of the specification has been amended as follows: 
Following extraction and comparison by SEALS and COWX, Align Hits was used to 
determine potentially interesting regions. Two such regions appeared, and were used for 
subsequent analyses [(see, Figure 38)]. Following extraction of sequence information from Ahgn 
Hits for thefirst region [1], CLUSTAL W was used to provide multiple sequence alignment 
[shown (see, Figure 39)]. Potential stem formation between base pairs was [is] given above the 
sequence alignment in a dome format [is shown in Figure 40.] Following conversion of the 
dome format file to a ct file, RNA Structure 3.21 was used to visualize the structure [(**, Figure 
41)] This structure is very similar to the one proposed by Zehner et al. [(see, Figure 42)]. 
Zehner et al. presented a detailed chemical analysis of their proposed structure for the minimal 
binding domain in the 3' UTR of Vimentin. This analysis included cleavage with single-sfrand- 
specific (ChS or Tl) or double-strand-specific (VI) nucleases as well as after exposure to lead 
acetate. 

Paragraph beginning at page 39, line 5 of the specification has been amended as follows: 
Following extraction of sequence information from Align Hits for thesecond region [2], 
CLUSTAL W was used to provide multiple sequence alignment [shown in Figure 43]. The 
potential stem formation between base pairs in thesecond region [2] was [is] given above the 
sequence alignment in a dome format [(see, Figure 44)]. Following conversion of the dome 
format file to a ct file, RNA Structure 3.21 was used to visualize the structure for the second 
region [2 (see, Figure 36)]. 

Paragraph beginning at page 39, line 18 of the specification has been amended as follows: 
A representative phylogenetic tree output for all Transferrin receptor orthologs in 
Hovergen database w obtained [is shown in Figure 45]. Each of these orthologs was saved « 
GenBank format and grouped together in a single data file. Untranslated regions in both the 5' 
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and 3* flanks of the coding region were extracted and compared using SEALS and COWX as 
described earlier (see, Figures ii [18] and 13 [25]). 

Paragraph beginning at page 39, line 23 of the specification has been amended as follows: 
Following extraction and comparison by SEALS and COWX, Align Hits was used to 
determine potentially interesting regions, [as shown in Figure 46. This can be seen where a 
vertical line intersects a series of horizontal lines representing sequence information from a set of 
species. This] The first region, between base pairs 920 to 990 a in the 3 prime UTR of transferrin 
receptor was extracted from all species for alignment with CLUSTAL W (1.74). 

Paragraph beginning at page 39, line 28 of the specification has been amended as follows: 
Following extraction of sequence information from Align Hits for the first region [1], 
CLUSTAL W (1,74) was used to provide multiple sequence alignment [as shown in Figure 47]. 
A representative potential stem formation between base pairs was [is] given above the sequence 
alignment in a dome format [as shown in Figure 48]. Following conversion of the dome format 
file to a ct file, RNA Structure 3.21 was used to visualize the structure, [(see, Figure 49). This 
can be seen where a vertical line intersects a series of horizontal lines representing sequence 
information from a set of species. This] The second region^ between base pairs 990 to 1050 a in 
the 3 prime UTR of transferrin receptor was extracted from all species for alignment with 
CLUSTAL W (1.74) [{see, Figure 50)]. 

Paragraph beginning at page 40, line 7 of the specification has been amended as follows: 
Following extraction of sequence information from Align Hits for the second region [2], 
CLUSTAL W (1.74) was used to provide multiple sequence alignment [as shown in Figure 51]. 
Potential stem formation between base pairs was [is] given above the sequence alignment in a 
dome format [as shown in Figure 52]. Following conversion of the dome format file to a ct file, 
RNA Structure 3.21 was used to visualize the structure [as shown in Figure 53]. Following 
extraction and comparison by SEALS and COWX, Align Hits was used to determine potentially 
interesting regions. [This can be seen where a vertical line intersects a series of horizontal lines 
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representing sequence information from a set of species. This] The third region^ between base 
pairs 1372 to 1423 A in the 3 prime UTR of transferrin receptor was extracted from all species for 
alignment with CLUSTAL W (1.74) [(see, Figure 54)]. 

Paragraph beginning at page 40, line 17 of the specification has been amended as follows: 
Following extraction of sequence information from Align Hits for the third region [3], 
CLUSTAL W (1. Ex.34) was used to provide multiple sequence alignment [as shown in Figure 
55]. Potential stem formation between base pairs was [is] given above the sequence alignment in 
a dome format [as shown in Figure 56]. Following conversion of the dome format file to a ct 
file, RNA Structure 3.21 was used to visualize the structure [as shown in Figure 57]. Following 
extraction and comparison by SEALS and COWX, Align Hits was used to determine potentially 
interesting regions. [This can be seen where a vertical line intersects a series of horizontal lines 
representing sequence information from a set of species. This] The fourth region^ between base 
pairs 1439 to 1479 A in the 3 prime UTR of transferrin receptor was extracted from all species for 
alignment with CLUSTAL W (1.74) [(see, Figure 58)]. 

• 9 

Paragraph beginning at page 40, line 27 of the specification has been amended as follows: 
Following extraction of sequence information from Align Hits for the fourth region [4], 
CLUSTAL W (1. Ex.34) was used to provide multiple sequence alignment [as shown in Figure 
59]. Potential stem formation between base pairs was [is] given above the sequence alignment in 
a dome format [is shown in Figure 60]. Following conversion of the dome format file to a ct file, 
RNA Structure 3.21 was used to visualize the structure [as shown in Figure 61]. Following 
extraction and comparison by SEALS and COWX, Align Hits was used to determine potentially 
interesting regions. [This can be seen where a vertical line intersects a series of horizontal lines 
representing sequence information from a set of species. This] The fifth region,, between base 
pairs 1479 to 1542^ in the 3 prime UTR of transferrin receptor was extracted from all species for 
alignment with CLUSTAL W (1.74) [(see, Figure 62)]. 
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Paragraph beginning at page 41, line 6 of the specification has been amended as follows: 
Following extraction of sequence information from Align Hits for the fifth region [5], 
CLUSTAL W (1. Ex.34) was used to provide multiple sequence alignment [as shown in Figure 
63]. Potential stem formation between base pairs was [is] given above the sequence alignment in 
a dome format [is shown in Figure 64]. Following conversion of the dome format file to a ct file, 
RNA Structure 3.21 was used to visualize the structure [as shown in Figure 65]. 

Paragraph beginning at page 41, line 12 of the specification has been amended as follows: 
Orinithine decarboxylase (ODC) is the first enzyme in the polyamine biosynthetic 
pathway. Studies have shown existence of translational regulatory elements both in the 5' and 3' 
untranslated regions (Grens et al., J. Biol. Chem., 1990, 265, 11810). Secondary structures have 
been proposed to exist in both these regions, though there is no conclusive evidence for it. The 
methods described herein identified two structures in the 3'UTR, as shown below. The presence 
of one of these structures (see, Figure 15 [66]) was verified using mass spectrometry probing 
(Griffey, et al., Proc. SPIE-Int. Soc. Opt. Eng., 2985 (Ultrasensitive Biochemical Diagnostics II): 
82-86, which is incorporated herein by reference in its entirety). Two representative sequences 
that showed slight variation in their lengths [(see, Figure 67)] were made into RNA and subjected 
to MS structure probing. Results shown in Figure 15 [66] confirm the presence of a stem-loop 
structure. Accordingly, identification of a novel secondary structure can be identified from the 
methods described herein, and such existence has been independently verified by structure 
probing. 

Paragraph beginning at page 41, line 25 of the specification has been amended as follows: 
Phylogenetic tree outputs for all Ornithine Decarboxylase orthologs in Hovergen database 
were obtained [is shown in Figure 68 and Figure 69]. Each of these orthologs was saved in 
GenBank format and grouped together in a single data file. Untranslated regions in both the 5' 
and 3' flanks of the coding region were extracted and compared using SEALS and COWX as 
described earlier (see, Figures 11 [18] and 13 [25]). 
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Paragraph beginning at page 42, line 1 of the specification has been amended as follows: 
Following extraction and comparison by SEALS and COWX, Align Hits was used to 
determine potentially interesting regions [as shown in Figure 70]. Two such regions appeared 
[appear], and were used for subsequent analyses. Following extraction of sequence information 
from the first region [1], CLUSTAL W (1.74) was used to provide multiple sequence alignment 
shown. Each of the putative hit sequences was analyzed for the ability to form internal structure 
[as shown] in a [the] reverse complement matrix [depicted in Figure 71]. This was accomplished 
by analyzing each sequence in a matrix where the sequence is plotted 5' to 3' on the X axis and its 
complement is plotted 5' to 3' on the Y axis. Base-pairs along the diagonals indicate potential 
self- complementary regions that can form secondary structures. Domes view of the potential 
stem formation between base pairs in region 1 is given above the sequence alignment was 
determined using RevComp [(see, Figure 72)]. RNA Structure 3.2 was used to visualize the 
structure [(see, Figure 73)]. 

Paragraph beginning at page 42, line 13 of the specification has been amended as follows: 
Mass spectrometry analyses techniques were used to probe for structure. The cluster 
alignment of the first region of ornithine decarboxylase 3' UTR [Figure 67] showed presence of 
gaps/inserts in the multiple alignment. Two representative RNAs (gi404561 and gi35135) from 
the alignments [shown in Figure 67] were used for this experiment. Analysis of the pattern of 
induced fragmentation showed a very strong likelihood for base-paring along the top half of the 
stem-loop structure [(shown inverted in the figure)]. This corresponds to bases 1 1-14 and 20-23 
in 404561 or bases 8-1 1 and 18-21 in 35135. Bulged bases (G9 in 404561 or U22 in 35135) also 
showed characteristic fragmentation pattern. The bottom-half of the structure appeared to be less 
stable, and showed some fragmentation where our analyses had predicted base-paring. This was 
particularly true in the sequence 35135. This region, however, has several contiguous A-U or G- 
U base-pairs which tend to be less stable, and therefore have a higher probability of 
fragmentation. 
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Paragraph beginning at page 42, line 24 of the specification has been amended as follows: 
Following extraction of sequence information from Align Hits for the second region [2], 
CLUSTAL W was used to provide multiple sequence alignment [shown as shown in Figure 74]. 
Potential stem formation between base pairs in the second region [2] was [is] given above the 
sequence alignment in a dome format [as shown in Figure 75]. Following conversion of the 
dome format file to a ct file, RNA Structure 3.21 was used to visualize the structure for the 
second region [2 as shown in Figure 76], 

Paragraph beginning at page 43, line 2 of the specification has been amended as follows: 
A representative phylogenetic tree output for all IL-2 orthologs in Hovergen database was 
obtained [is shown in Figure 77]. Each of these orthologs was saved in GenBank format and 
grouped together in a single data file. Untranslated regions in both the 5' and 3' flanks of the 
coding region were extracted and compared using SEALS and COWX as described earlier {see, 
Figures H [18] and 13 [25]). 

Paragraph beginning at page 43, line 7 of the specification has been amended as follows: 
Following extraction and comparison by SEALS and COWX, Align Hits was used to 
determine potentially interesting regions in the 3*UTR region. Two such regions appear, and were 
used for subsequent analyses [(see, Figure 78)]. Following extraction of sequence information 
from Align Hits for the first region [1], CLUSTAL W (1.74) was used to provide multiple 
sequence alignment [shown in Figure 79]. Domes view of the potential stem formation between 
base pairs in the first region [1 is] was given above the sequence alignment [was determined] 
using RevComp [{see, Figure 80)]. RNA Structure 3.2 was used to visualize the structure [as 
depicted in Figure 81]. Following extraction of sequence information from Align Hits for the 
second region [2], CLUSTAL W (1.74) was used to provide multiple sequence alignment [shown 
in Figure 82]. Potential stem formation between base pairs in the second region was [2 is] given 
above the sequence alignment in a dome format [as shown in Figure 83]. Following conversion 
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of the dome format file to a ct file, RNA Structure 3.21 was used to visualize the structure for the 
second region [2 as shownln Figure 84]. 



Paragraph beginning at page 43, line 20 of the specification has been amended as follows: 
In addition to the two regions described above, a third region, downstream of, and 
partially overlapping the second region [2], was identified using an alternate reference sequence 
(3087784.fa) [and is shown in Figure 85]. Following extraction of sequence information from 
Align Hits for this region, CLUSTAL W (1.74) was used to provide multiple sequence alignment 
[shown in Figure 86]. Potential stem formation between base pairs in the third region [3 is] was 
shown [in Figure 87] above the sequence alignment in a dome format. Following conversion of 
the dome format file to a ct file, RNA Structure 3.21 was used to visualize the structure for the 
third region [3 {see, Figure 88)]. 

Paragraph beginning at page 43, line 29 of the specification has been amended as follows: 
Representative phylogenetic tree output for all IL-4 orthologs in Hovergen database was 
obtained [is shown in Fig'Ore 89]. Each of these orthologs was saved in GenBank format and 
grouped together in a single data file. Untranslated regions in both the 5' and 3* flanks of the 
coding region were extracted and compared using SEALS and COWX as described earlier {see, 
Figures U [18] and 13 [25]). \ 

Paragraph beginning at page 44, line 4 of the specification has been amended as follows: 
Following extraction and comparison by SEALS and COWX, Align Hits was used to 
determine potentially interesting regions in the 5TJTR region [as shown in Figure 90]. Following 
extraction of sequence information from Align Hits for the above region, CLUSTAL W (1.74) 
was used to provide multiple sequence alignment [shown in Figure 91]. Domes view of the 
potential stem formation between base pairs in the region was [is] given above the sequence 
alignment [was determined] using RevComp [{see, Figure 92)]. RNA Structure 3.2 was used to 
visualize the structure [as shown in Figure 93]. 
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Paragraph beginning at page 44, line 11 of the specification has been amended as follows: 
[Figure 94 depicts a representative] Align Hits was used to view [of] hits in the 3TJTR 
region of IL-4. Following extraction of sequence information from Align Hits for the 3'UTR 
region, CLUSTAL W (1.74) was used to provide multiple sequence alignment [as shown in 
Figure 95]. Potential stem formation between base pairs in the second region [2 is] was given 
above the sequence alignment in a dome format [is shown in Figure 96]. Following conversion 
of the dome format file to a ct file, RNA Structure 3.21 was used to visualize the structure for the 
second region [2 {see, Figure 97)]. 

In the Claims: 

Claims 35, 43 and 51 have been amended as follows: 
35. (Twice Amended) An oligonucleotide comprising a molecular interaction site that is present 
in the RNA of a selected organism and in the RNA of at least one additional organism, wherein said 
molecular interaction site serves as a binding site for at least one molecule that when bound to said 
molecular interaction site modulates the expression of said RNA in said selected organism, wherein 
said oligonucleotide does'flot comprise the iron response element or the 3' untranslated region of the 
histone mRNA . wherein said molecular interaction site is identified by a method comprising: 

comparing the nucleotide sequence of said RNA of a selected organism with the nucleotide 
sequences if a plurality of nucleic acids from different taxonomic species; 

identifying at least one sequence region which is conserved among said plurality of nucleic 
acids and said RNA of a selected organism [target nucleic acid]; 

determining whether said conserved region has secondary structure; and 

for said conserved region having secondary structure, identifying said secondary structure. 

43. (Amended) The oligonucleotide of claim 35 further comprising comparing said RNA of a 
selected organism [target nucleic acid] to paralogous nucleic acids. 

5 1 . (Amended) An oligonucleotide comprising a molecular interaction site that is present in the 



68 



DOCKET NO.: IBIS-0012 PATENT 

RNA of a selected organism and in the RNA of at least one additional organism, wherein said 
molecular interaction site serves as a binding site for at least one molecule that when bound to said 
molecular interaction site modulates the expression of said RNA in said selected organism, wherein 
said oligonucleotide does not comprise the iron response element or the 3' untranslated region of the 
histone mRNA . wherein said molecular interaction site is identified by a method comprising: 

comparing the nucleotide sequence of said target nucleic acid with the nucleotide sequences 
of a plurality of nucleic acids from different taxonomic species, wherein said nucleotide sequence of 
said RNA of a selected organism is determined by assembling a plurality of expressed sequence tags; 

identifying at least one sequence region which is conserved among said plurality of nucleic 
acids and said RNA of a selected organism; 

determining whether said conserved region has secondary structure; and 

for said conserved region having secondary structure, identifying said secondary structure. 
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